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INTRODUCTION 


The  noise  signature  of  a  distant  helicopter  when  first  aurally 
detected  by  an  unaided  human  observer  in  the  field  is  usually 
not  much  greater  in  level  than  the  background  noise  in  the 
observer's  vicinity.  As  small  fluctuations  in  signal  levels 
(due  to  the  vagaries  of  long  range  propagation  of  the  heli¬ 
copter's  acoustic  emissions  through  an  inhomogeneous  atmosphere) 
and/or  fluctuations  in  ambient  noise  levels  (due  to  random 
variations  in  the  background  noise  process)  perturb  the  signal 
to  noise  ratio  at  the  observer's  ear,  the  helicopter  is  dis- 
continuously  audible. 

For  example,  when  atmospheric  conditions  favor  propagation  of 
acoustic  energy  in  the  frequency  range  in  which  a  helicopter  is 
maximally  detectable  for  a  few  hundred  milliseconds,  and  when 
background  noise  levels  happen  to  be  simultaneously  low,  a 
distant  helicopter  may  be  momentarily  audible.  Moments  later, 
however,  a  moving  helicopter  may  become  inaudible  and  remain 
so  for  several  seconds  for  any  of  a  large  number  of  reasons. 
These  include  changes  in  source  levels  during  various  flight 
maneuvers  (Reference  1),  shielding  from  terrain  features, 
random  wind  gusts,  etc. 

Algorithms  for  predicting  ranges  at  which  helicopters  first 
become  audible  should  therefore  recognize  the  intermittent  and 
short  duration  of  audibility.  This  in  turn  requires  consi¬ 
deration  of  the  effects  of  duration  of  human  audibility  of 


1Lowson,  M.  V.  ,  and  Ollerhead,  J.  B. ,  STUDIES  OF  HELICOPTER  ROTOR 
NOISE,  USAAVLABS  TR  68-60,  U.S.  Army  Aviation  Materiel  Labora¬ 
tories,  Fort  Eustis,  Virginia,  January  1969,  AD  684394. 
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acoustic  signals,  and  of  the  interactions  between  duration  and 
signal  frequency. 


This  report  ^escribes  research  undertaken  to  determine  whether 
the  Applied  Technology  Laboratory's  existing  procedures  for 
predicting  detection  ranges  of  helicopters  require  revision  to 
account  for  any  potential  differences  in  the  time  period  over 
which  people  integrate  signal  energy  at  different  frequencies. 
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LITERATURE  REVIEW 


The  most  common  framework  for  considering  effects  of  duration 
on  the  ear’s  sensitivity  to  acoustic  signals  rests  on  the 
assumption  that  the  ear  integrates  the  power  of  an  acoustic 
signal  over  a  fixed  period  of  time.  The  period  of  integration 
is  usually  referred  to  as  the  time  constant  of  the  ear.  Zwis- 
locki's  mathematical  model  of  "temporal  auditory  summation" 
(Reference  2)  provides  a  useful  framework  for  the  following 
discussion: 


10  log  (It/IJ  =  10  log[  1/ (l-e-t/T )  ]  Eq.  1 

where  is  the  signal  power  (e.g.,  in  watts)  required  for 
detection  of  a  tone  of  duration  t,  Itt  is  the  power  required  for 
detecting  the  same  tone  of  indefinitely  long  duration,  and  t  is 
the  time  constant. 

Equation  1  models  sensory  processes  that  occur  up  to  a  few 
hundred  milliseconds  after  the  onset  of  stimulation.  The  time 
constant  (x)  of  Equation  1  represents  the  amount  of  time  that 
must  pass  before  near-steady-state  conditions  are  reached. 

Within  this  time  period,  a  doubling  of  signal  duration  improves 
detection  performance  by  approximately  3  decibels.  In  other 
words,  3  dB  less  signal  to  noise  ratio  is  needed  in  this  tem¬ 
poral  regime  to  detect  a  signal  twice  as  long  as  another. 

For  prolonged  stimulation  (i.e.,  signal  durations  much  longer 
than  x),  cognitive  rather  than  sensory  factors  govern  the 


2Zwislocki,  J.  J.,  THEORY  OF  TEMPORAL  AUDITORY  SUMMATION,  Journal 
of  the  Acoustical  Society  of  America,  32,  1046,  I960. 
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ability  of  human  observers  to  make  detection  decisions.  These 
nonsensory  factors  are  discussed  more  fully  on  page  34  et  seq. 
of  this  report.  For  reasons  discussed  more  fully  later,  a 
doubling  of  signal  duration  in  this  temporal  regime  improves 
detection  performance  by  only  about  1.5  decibels. 

Although  most  researchers  subscribe  to  this  general  view,  there 
is  "remarkably  little  agreement  about  the  duration  of  the  ear's 
time  constant"  (Reference  3).  Fidell  et  al.  (Reference  3) 
provide  the  following  brief  summary  of  estimates  of  the  ear’s 
time  constant:  "One  of  the  first  to  investigate  the  time  con¬ 
stant  of  the  ear  was  von  Blk£sy  (Reference  4).  By  varying  the 
duration  of  an  0.800  kHz  sinusoid,  he  concluded  that  about  180 
msec  was  needed  for  it  to  reach  its  maximum  loudness.  Munson 
(Reference  5)  used  frequencies  of  0.125,  1.000,  and  5.650  kHz  and 
estimated  the  time  constant  to  be  about  250  milliseconds.  Miller 
(Reference  6)  used  broadband  noise  and  estimated  the  time  con¬ 
stant  to  be  about  65  milliseconds  at  the  higher  sensation  levels. 


■7 

JFidell,  Sanford,  Karl  S.  Pearsons,  Mario  Grignetti  and  David  M. 
Green,  THE  NOISINESS  OF  IMPULSIVE  SOUNDS,  Journal  of  the  Acous¬ 
tical  Society  of  America,  48,  No.  6,  Part  1,  1970. 

Von  B£k£sy ,  G.,  ZUR  THEORIE  DES  HORENS.  UEBER  DIE  BESTIMMUNG 
DES  EINEM  REINEN  TONEMPFINDEN  ENTSPRECHENDEN  ERREGUNGSEBIETES 
DER  BASILARMEMBRAN  VERMITTELS  ERMUDUNGSERCHEINUNGEN ,  Phys.  Z., 
30,  115-125,  1929. 

^Munson,  W.  A.,  THE  GROWTH  OF  AUDITORY  SENSATION,  Journal  of  the 
Acoustical  Society  of  America,  19,  584-591,  1947. 

^Miller,  G.  A.,  PERCEPTION  OF  SHORT  BURSTS  OF  NOISE,  Journal  of 
the  Acoustical  Society  of  America,  20,  160-170,  1948. 
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Small,  Brandt,  and  Cox  (Reference  7)  performed  a  similar  experi¬ 
ment,  but  found  consistently  smaller  values.  At  high  levels 
the  time  constant  estimated  by  Small  et  al.  appears  to  be  in 
the  range  of  10-20  milliseconds.  Von  Port  (Reference  8)  investi¬ 
gated  a  variety  of  brief  impulsive  sounds.  Most  were  gated  noises 

of  different  bandwidths,  center  frequencies,  and  levels.  He 
estimated  that  70  milliseconds  was  a  reasonable  value  of  the  time 

constant  for  a  variety  of  stimuli.  Von  Niese  (Reference  9)  has 
summarized  a  number  of  studies  from  the  Dresden  Laboratories. 

He  concluded  that  25  milliseconds  was  a  good  average  time  con¬ 
stant  value  and  argued  that  this  estimate  agreed  nicely  with 
reverberation  time  estimates.  Von  Zwicker  (Reference  10)  took 
issue  with  the  conclusions  of  the  Dresden  group  and  presented 
more  data  from  the  Stuttgart  Laboratory  consistent  with  the 
longer  estimate,  approximately  100  milliseconds.  Stevens  and 
Hall  (Reference  11),  using  a  magnitude  estimation  technique, 
measured  the  time  constant  at  about  150  milliseconds. 


^ Small,  A.  M. ,  Jr.,  J.  F.  Brandt,  and  P.  G.  Cox,  LOUDNESS  AS  A 
FUNCTION  OF  SIGNAL  DURATION,  Journal  of  the  Acoustical  Society 
of  America,  34,  513,  1962.  . . 

O 

von  Port,  E.,  UBER  DIE  LAUTSTARKE  EINSELNER  KURZER  SCHALLIM- 
PULSE,  Acustica,  13,  211-223,  1963. 

^ von  Niese,  H.,  METHODE  ZUR  BESTIMMUNG  DER  LAUTSTARKE  BELIEBIGER 
GERAUSCHE,  Acustica,  15,  117-126,  1965. 

10von  Zwicker,  E. ,  EIN  BEITRAG  ZUR  LAUTSTARKEMESSUNG  IMPULSHAL- 
TIGER  SCHALLE,  Acustica,  17,  11-22,  1966. 

11Stevens ,  J.  C.,  and  J.  W.  Hall,  BRIGHTNESS  AND  LOUDNESS  AS  A 
FUNCTION  OF  STIMULUS  DURATION,  Perception  and  Psychophysics. 

1,  319-327,  1966.  - - - - 
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No  one  factor  accounts  for  this  variability  in  estimates  of  *  in- 
ear's  time  constant,  although  factors  such  as  experimental  pro¬ 
cedures,  instructions  to  observers,  and  individual  differences 
probably  account  for  a  fair  amount  of  the  variability.  Watson 
and  Gengel  (Reference  12)  contend  that  a  major  part  of  the  vari¬ 
ability  may  be  attributed  to  genuine  differences  in  the  duration 
of  the  time  constant  as  a  function  of  frequency.  Table  1, 
adapted  from  Reference  12,  catalogs  a  number  of  studies  from 
which  Watson  and  Gengel  conclude  that  "the  form  of  the  (temporal; 
integration  function  has  been  found  to  change  with  frequency, 
in  that  the  time  constant  is  reduced  as  frequency  is  increased 
in  all  experiments  that  satisfy  the  following  conditions: 

A)  The  range  of  signal  durations  investigated  is  wide  enough 
to  allow  significant  (as,  10-20  dB)  differences  between  short- 
tone  and  long-tone  thresholds; 

B)  A  wide  enough  range  of  frequencies  is  investigated,  gene¬ 
rally,  a  range  including  at  least  the  frequencies  250-4000  Hz; 
and 

C)  A  relatively  precise  psychophysical  procedure  is  used." 

In  summary,  there  is  good  reason  to  believe  that  the  time  con¬ 
stant  of  the  ear  is  on  the  order  of  200-300  milliseconds,  and 
that  it  varies  inversely  with  signal  frequency.  No  information 
is  available  about  the  duration  of  the  time  constant  at  fre¬ 
quencies  below  100  Hz.  Watson  and  Gengel  (Reference  12,  Figure 
6)  present  evidence  that  the  time  constant  varies  from  about  150 
milliseconds  at  125  Hz  to  about  50  milliseconds  at  8  kHz,  a  range 
of  roughly  5  dB. 

12Watson,  Charles  S.,  and  Roy  W.  Gengel,  SIGNAL  DURATION  AND 
SIGNAL  FREQUENCY  IN  RELATION  TO  AUDITORY  SENSITIVITY,  The 
Journal  of  the  Acoustical  Society  of  America,  46,  No.  4 , 


TABLE  1.  SUMMARY  OF  STUDIES  OF  MONAURAL  TIME-INTENSITY 


Studies  In  which  definite  frequency  effect  Is  found  are  those  in  which  the  integration  time 
clearly  decreases  as  a  function  of  frequency. 


TABLE  1.  Continued. 
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RATIONALE  FOR  CURRENT  INVESTIGATION 


The  purpose  of  the  current  study  was  to  test  a  key  as sump 
a  prior  study  undertaken  for  the  Applied  Technology  Labor 
The  prior  study  (Reference  27)  attributed  differences  in 
to  noise  ratio  necessary  to  maintain  constant  detection  p 
mance  at  various  frequencies  to  frequency  domain  effects. 
"Effective  masking  bandwidths"  were  estimated  from  the  la 
Fidell  et  al.  (Reference  27)  by  assuming  that  all  other  4 
being  equal,  observed  differences  in  signal  to  noise  r:v  ' 
necessary  to  maintain  76%  correct  detection  per  forma  no-  ' 
two-alternative  forced  choice  task  in  which  st  s,<  i  vr. 
a  sinusoid  of  various  frequencies  in  noise  could  b.  • ' •  r: 
to  differences  in  masking  bandwidths  at  the  various  ♦,r<  n: 
This  assumption  is  essential  to  Fletcher's  (hei'e?-<  no-.  .  •• ) 
"critical  ratio"  argument  used  by  Fidell  et  al.  ( h •■!*.•  r • . 


Because  the  750  msec  duration  of  the  sinusoids  dot  i 
Reference  27  was  much  longer  than  the  time  constant  of 
it  was  not  possible  to  determine  whether  the  frequency 
dence  of  the  ear's  time  constant  affected  the  estimate.: 
masking  bandwidths  inferred  by  the  critical  rat  io  im  *  !.> 
overall  intent  of  the  current  research  was  to  del  erml  in¬ 
differences  in  integration  time  at  different  frequen- 1  • 
revision  of  earlier  estimates  of  masking  bandwidths. 


:U 


Fidell,  S.  A.,  R.  D.  Horonjeff  and  D.  M.  Green,  LOW  KH: 
ACOUSTIC  DETECTION  RESEARCH  IN  SUPPORT  OF  HUMAN  PET!-  ’TI 
PREDICTION,  USARTL-TR-79-25,  U.S.  Army  Research,  and  T.-c 
Laboratories,  Fort  Eustis,  Virginia,  October  iQ7q,  AL  A 

P  ft 

Fletcher,  H.,  SPEECH  AND  HEARING  IN  COMMUNICATION,  N,-w 
Van  Nostrand,  1953* 
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The  principal  goals  of  the  current  study  were  therefore  to 
extend  existing  information  about  the  duration  of  the  ear's 
time  constant  to  lower  frequencies,  and  to  determine  the 
relationship  between  integration  time  at  these  lower  fre¬ 
quencies  and  at  higher  frequencies.  This  relationship  must 
be  known  in  order  to  assess  the  validity  of  the  "all  other 
things  being  equal"  assumption  that  permitted  inference  of 
effective  masking  bandwidths  in  the  frequency  domain  in 
Reference  27. 


METHOD 


PILOT  STUDY 


A  pilot  study  was  undertaken  as  a  preliminary  investigation  of 
the  effects  of  signal  duration  on  detectability  at  various  fre¬ 
quencies.  The  pilot  study  took  advantage  of  an  ongoing  study  of 
the  symmetry  of  the  auditory  filter  supported  by  the  U.S.  Army 
Tank-Automotive  Research  and  Development  Command  (Reference  29). 
The  goal  of  this  study  was  to  investigate  the  shape  of  the  hypo¬ 
thetical  auditory  filter  through  which  people  are  assumed  to 
detect  signals  embedded  in  noise. 

The  logic  of  the  TARADCOM  study  differed  from  that  of  Reference 
27  in  that  masking  bandwidths  were  measured  directly  rather  than 
inferred  by  the  critical  ratio  argument.  The  masking  bandwidths 
were  measured  by  forcing  observers  to  listen  for  a  sinusoid 
within  a  gap  in  the  background  noise  spectrum  that  was  dis¬ 
continuous  in  the  frequency  domain.  Digitally  manufactured 
noise  with  precisely  specified  band  limits  and  exceptionally 
steep  skirts  constrained  the  observers’  listening  strategies. 

The  trial  procedure  was  a  two-alternative  forced  choice  detection 
task  in  blocks  of  50  trials  with  a  constant  signal  to  noise 
ratio.  Detection  performance  at  each  frequency  was  assessed  at 
several  different  signal  to  noise  ratios.  Best  fitting  regres¬ 
sion  lines  to  these  data  were  obtained  for  each  observer  and  each 
frequency  by  calculating  least  square  regression  solutions  to 
points  in  the  linear  portions  of  the  psychometric  functions. 


2^Fldell,  S.  A.,  R.  D.  Horonjeff,  S.  R.  Teffeteller,  and  D.  M. 
Green,  REVISION  OF  ACOUSTIC  DETECTION  RANGE  PREDICTION  MODEL 
BASED  ON  PSYCHOACOUSTIC  STUDY  OF  LOW  FREQUENCY  MASKING,  Bolt 
Beranek  and  Newman  Report  4421,  June  1980. 
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The  slopes  and  Intercepts  of  these  regression  equations  were 
then  averaged  over  all  observers  at  each  frequency  to  yield  best 
fitting  regression  lines  in  order  to  estimate  differences  in 
signal  to  noise  ratios  necessary  to  maintain  constant  detection 
performance.  All  testing  for  the  TARADCOM  study  was  conducted 
at  a  single,  fixed  signal  duration  of  250  milliseconds. 

The  pilot  study  was  conducted  in  conjunction  with  the  TARADCOM 
study  to  provide  an  independent  check  on  the  critical  ratio 
estimates  of  masking  bandwidths  of  Reference  27.  Observers  were 
tested  at  additional  signal  durations  of  100,  125,  and  1000 
milliseconds .  All  testing  was  conducted  at  124  Hz  with  a  con¬ 
tinuous  background  noise  spectrum.  That  is,  the  noise  did  not 
contain  a  discontinuous  spectral  notch  in  the  frequency  domain. 
Estimates  of  signal  to  noise  ratios  necessary  to  maintain  con¬ 
stant  detection  performance  at  different  signal  durations  were 
used  as  an  independent  check  on  estimates  derived  from  the  main 
study . 

MAIN  STUDY 


Five  observers  (of  22.2  years  average  age)  were  paid  at  the  rate 
of  $3-50  per  hour  to  investigate  possible  differences  in  detec¬ 
tion  performance  caused  by  differences  in  integration  time  at 
different  frequencies.  (Observers  in  the  previous  low  fre¬ 
quency  acoustic  detection  study  conducted  for  the  Applied  Tech¬ 
nology  Laboratory  [Reference  27]  detected  sinusoids  at  1000, 

500,  250,  125,  63,  and  40  Hz  in  a  two-alternative  forced  choice 
detection  task  at  a  fixed  signal  duration  of  750  milliseconds.) 
An  incomplete  factorial  design  of  combinations  of  frequencies 
and  durations  was  used  in  this  study. 

The  range  of  signal  durations  was  chosen  from  50  to  1000  milli¬ 
seconds.  A  priori  information  suggested  that  the  integration 
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time  of  the  human  auditory  system  is  approximately 
seconds.  Thus,  three  durations  were  chosen  for  testing  or: 
either  side  of  this  range.  These  durations  included  DC,  in-'  , 
200,  500,  750.  and  1000  milliseconds. 

Signal  durations  shorter  than  50  milliseconds  were  avoided  for1 
two  reasons.  First,  as  noted  by  Garner  and  Miller  1 n  1 9 ^ 7 
(Reference  15),  rapid  gating  of  a  sinusoid  at  shorter  durations 
spreads  signal  energy  into  spectral  regions  higher  than  the 
fundamental  frequency  of  the  sinusoid,  thus  complicating  analyse 
of  temporal  domain  effects.  Second,  signals  with  durations 
shorter  than  50  milliseconds  are  essentially  impulses ,  wL 
detectability  may  not  be  well  predicted  by  the  steady-si  at- 
equations  built  into  the  Applied  Technology  Laboratory's  rar.r- 
prediction  software  (Reference  30). 

Table  2  contains  the  combinations  of  sinusoids  and  durations 
heard  by  the  five  observers  in  the  main  study.  The  ord-r  in 
which  observers  encountered  sinusoids  of  varying  durations  war 
randomized,  except  at  40  Hz.  Data  were  collected  last  at  this 
frequency  due  to  delays  for  equipment  repairs  and  recalibrat i on 
of  the  sound  reproduction  system. 


30Abrahamson,  A.  Louis,  CORRELATION  OF  ACTUAL  AND  ANALYTICAL 
HELICOPTER  AURAL  DETECTION  CRITERIA,  Vol  .  ]  ,  Uf.AAMRD:  -?}•-"•<- 
102A,  U.S.  Army  Air  Mobility  Research  and  1)<  ve  1  < .nm  nU  Lat 
tory,  Fort  Eustis,  Virginia,  January  1971’,  A I  h  >  "L. 
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TABLE  2.  SINUSOIDAL  SIGNAL  FREQUENCIES,  BACKGROUND  NOISES 
AND  DURATION  CONDITIONS  TESTED 

Narrow  Band  Wide  Band 

Background  Noise  Background  Noise 


40 

6° 

125 

125 

250  1000 

Hz 

Hz 

Hz 

Hz 

Hz 

Hz 

Frequency 

Duration 

(ms) 

50 

X 

X 

X 

X 

X 

100 

X 

X 

X 

X 

X 

200 

X 

X 

X 

X 

X 

X 

500 

X 

X 

X 

X 

X 

750 

X 

X 

X 

X 

X 

X 

1000 

X 

X 

X 

X 

X 

Sinusoids  at 

500 

Hz  were  not 

Included  in  the 

detection 

task  of 

the  current 

study  although  sinusoids 

at  this 

frequency 

were 

included  in 

the 

prior 

study 

for  the 

Applied 

Technology 

Labo- 

ratory  (Reference  27).  Sinusoids  at  500  Hz  were  omitted  since 
the  data  collected  at  125,  250  and  1000  Hz  varied  by  only  a 
small  degree.  Preliminary  findings  suggested  that  the  greatest 
difference  In  Integration  time  of  the  human  auditory  system 
would  be  observed  at  low  frequencies.  Thus,  greater  emphasis 
was  placed  on  detection  tasks  below  125  Hz. 

New  software  was  developed  to  provide  trialwise  adaptive  ad¬ 
justment  of  signal  presentation  levels.  Trlal-by-trial  adjust¬ 
ment  of  signal  to  noise  ratios  was  adopted  because  of  its  great¬ 
er  efficiency  relative  to  blockwlse  adjustment  of  signal  to 
noise  ratios.  In  the  prior  study  conducted  by  BBN  for  the 
Applied  Technology  Laboratory  (Reference  27),  data  were  col¬ 
lected  under  32  test  conditions  (combinations  of  signal  fre¬ 
quencies  and  durations);  thus  a  more  efficient  means  of  col¬ 
lecting  data  was  required. 
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In  the  trialwise-adaptive  two-alternative  forced  choice  signal 
detection  procedure,  the  level  of  the  signal  can  he  adjusted 
after  each  detection  decision.  The  method  was  drawn  from  the 
family  of  block  up-and-down  methods  (Reference  31)  in  which  the 
level  of  the  signal  is  adjusted  after  a  subset  of  trials.  For 
example,  in  the  "2  down,  1  up"  procedure,  the  computer  lowers 
the  signal  to  noise  ratio  following  each  incorrect  decision. 

The  program  can  also  look  for  a  "3  down,  1  up"  subset  of  trials 
in  which  three  correct  detection  judgments  are  required  at  a 
given  signal  to  noise  ratio  before  lowering  the  level  of  the 
signal.  The  "2  down,  1  up"  procedure  estimates  71  percent 
correction  detection  performance  from  the  test  subject  while 
the  "3  down,  1  up"  method  estimates  79  percent  correct  detec¬ 
tion. 

The  current  experiment  was  set  up  to  define  a  76  percent  correct 
level.  To  obtain  this  level  of  performance,  the  computer  ini¬ 
tially  waited  for  two  correct  responses  to  lower  the  level  of 
the  signal.  Thereafter,  the  computer  randomly  chose  between  two 
and  three  correct  responses  before  lowering  the  level  of  the 
signal.  Further  details  on  the  experimental  operating  pro¬ 
cedures  may  be  found  in  Appendix  A  along  with  sample  line  print¬ 
er  and  oscilloscopic  outputs. 

Three  background  noise  spectra  were  used  in  the  study.  Atten¬ 
tion  was  once  again  paid  to  the  influence  of  absolute  level  on 
detectability  since  Stevens  (Reference  32)  suggested  "the 
critical  duration  (or  endpoint  of  energy  integration)  appears 


31 

Levitt,  H.,  and  L.  R.  Rabiner,  USE  OF  A  SEQUENTIAL  STRATEGY 
IN  INTELLIGIBILITY  TESTING,  Journal  of  the  Acoustical  Society 
of  America,  42,  No.  3,  1967. 

32 

Stevens,  S.  D.  G.,  AUDITORY  TEMPORAL  INTEGRATION  AS  A  FUNC¬ 
TION  OF  INTENSITY,  Journal  of  Sound  and  Vibration,  30(1), 
109-126,  1973-  21 
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to  become  shorter  ™ith  increasing  overall  level". 


Table  3  shows  the  spectrum  levels  of  the  Gaussian  masking  noise 
used  in  this  study.  Sinusoids  at  1000,  250  and  125  Hz  were 
detected  in  noise  at  a  spectrum  level  of  40  dB.  Sinusoids  at 
125  and  h 0  Hz  were  detected  in  noise  at  a  spectrum  level  of  60 
dB,  and  tones  at  63  Hz  were  detected  in  noise  at  a  spectrum 
level  of  55  dB.  Table  3  also  shows  the  assumed  masking  band- 
widths  at  each  frequency,  derived  from  the  data  of  the  previous 
study  conducted  for  the  Applied  Technology  Laboratory  (Reference 
27). 


TABLE  3.  MASKING  NOISE  SPECTRUM  LEVELS 


Tone 

Frequency 

(Hz) 

Assumed 

Masking 

Bandwidth 

(Hz) 

Spectrum 

Level 

(dB  re:  20pPa) 

Masking 

Noise 

Bandwidth 

(Hz) 

Masking 
Noise 
Over¬ 
all  SPL 
(dB  re : 
20yPa) 

40 

90 

60 

17-400 

85.8 

63 

66 

55 

17-400 

80.8 

125 

70 

60 

17-400 

85.8 

125 

70 

40 

17-2000 

73.0 

250 

66 

40 

17-2000 

73.0 

1000 

153-3 

40 

17-2000 

73.0 

*Taken  from  Reference  27,  Table  III,  Page  45. 
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Observers  detected  the  sinusoids  in  blocks  of  r.. 
forced  choice  trials.  An  individual  trial  cent  a  irw  i  ’  v. 
tion  Intervals  of  a  duration  specified  by  the  ex;  — r  im-v.-  .  : 
observation  intervals  were  separated  by  b'JO  nilil.  ec'rdo. 
sponse  interval  of  750  milliseconds  followed  the  i 

tion  interval. 


During  one  of  the  two  observation  intervals,  a  sinusoid  at 
tone  frequency  was  embedded  in  coni  Inuous  rack, -round  n 
Observers  were  instructed  to  s  .n~-  of  two  i*  gi.ted  swir. 
at  this  time  to  indicate  the  observation  interval  in  which 


sinusoid  was  thought  to  have  occurred.  If  a  response  wa 
during  the  response  interval,  feedback  was  supplied  ••  • 
observer  by  lighting  the  response  switch  corresponding  * 
interval  in  which  the  tone  actually  occurred.  An  invar* 


period  of  500  milliseconds  separated  successive  trials. 


To  encourage  conscientious  performance,  a  bonus  was  aw.-.r.U 
each  block  of  trials  based  on  the  observer's  dc toot  i  on  per 
mance  relative  to  the  average  detection  performance  at  a  in¬ 
frequency  and  duration.  Observers  were  tested  in  daily  2- 
sessions  with  rest  breaks  approximately  every  20  minutes. 

ing  continued  over  a  period  of  several  weeks  under  fm. 
tening  conditions  in  an  anecholc  chamber.  Figure  1  is  a  b 
diagram  of  the  equipment  used  to  conduct  the  experiment, 
structlons  to  test  subjects  may  be  found  in  Appendix  A. 


23 


Ultra-Lo* 
Band  Pass 
Filter 


Figure  1.  Block  Diagram  of  Circuitry  for  Experiment. 


RESULTS 


•W 


PILOT  STUDY 


Preliminary  data  used  to  estimate  the  signal  to  noise  ratios 
necessary  for  7 6  percent  correct  detection  were  obtained  at 
four  different  durations  of  sinusoids  at  124  Hz,  from  100  to 
1000  milliseconds.  Psychometric  functions  were  estimated  for 
each  duration  over  a  range  of  signal  to  noise  ratios.  The 
signal  to  noise  ratios  were  adjusted  to  the  performance  of  each 
observer.  As  a  rule,  signal  to  noise  ratios  slowly  decreased 
over  successive  blocks  of  trials  to  avoid  a  sudden  degradation 
in  detection  performance. 

Signal  to  noise  ratios  were  generally  lowered  in  1-dB  steps 
until  performance  fell  below  90  percent  correct  detection. 
Further  adjustments  were  made  in  0.5-dB  steps.  However,  it  was 
frequently  necessary  to  repeat  signal  to  noise  ratio  conditions 
several  times  to  obtain  stable  estimates  of  performance  at  a 
given  level,  and  least  square  regression  lines  at  each  frequency 
and  signal  duration  were  frequently  based  on  a  small  number  of 
data  points  due  to  rapid  degradation  in  detection  performance 
by  observers.  Thus,  the  major  finding  of  the  pilot  study  was 
the  need  for  a  more  efficient  psychophysical  procedure  to  be 
used  in  the  main  study. 

MAIN  STUDY 


The  signal  to  noise  ratios*  for  76  percent  correct  detection 
performance  (corresponding  to  d1  *  1)  averaged  across  the  five 
observers  are  shown  in  Figures  2  through  6.  Table  4  shows 
results  for  individual  observers.  The  signal  to  noise  "ratios" 


*See  Appendix  C. 
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Figure  2 


W  200  500  750  1000 

Signal  duration  in  milliseconds 


Average  Signal  to  Noise  Level  at  76 %  Correct 
Detection  With  Varying  Signal  Durations  at 
40  Hz . 
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>00  200  500  750  1000 

Signal  Duration  in  milliiacondi 


Figure  4.  Average  Signal  to  Noise  Level  at  76%  Correct 
Detection  With  Varying  Signal  Durations  at 
125  Hz. 


Signal  to  Noise  Ratio  in  dB  at  d' 


1 


Figure  5.  Average  Signal  to  Noise  Level  at  76%  Correct 
Detection  With  Varying  Signal  Durations  at 
250  Hz. 
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M  100  ZOO  500  750  10CO 

Signal  Duration  in  milliiocondi 

Figure  6.  Average  Signal  to  Noise  Level  at  7670  Correct 
Detection  With  Varying  Signal  Durations  at 
1000  Hz. 


TABLE  4.  SIGNAL  TO  NOISE  RATIOS  FOR  76°/;  CORRECT  DETECTION 
PERFORMANCE  FOR  ALL  OBSERVERS 
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are  based  on  the  sound  pressure  level  of  the  sine  wave  at  each 
frequency  minus  the  average  spectrum  level  (dB  per  Hertz)  of  the 
noise  over  the  estimated  masking  bandwidths  of  Reference  27  (see 
Table  3)-  Notice  that  as  signal  durations  at  all  frequencies 
increase,  the  signal  to  noise  ratios  required  for  a  constant  level 
of  detection  performance  decrease. 

Figures  2  through  6  show  two  least  square  regression  lines.  The 
first  regression  line  is  drawn  through  the  three  shortest  dura¬ 
tion  points  that  fall  below  the  expected  Integration  time  of 
the  ear.  The  second  line  is  drawn  through  the  three  points 
presented  at  durations  longer  than  the  expected  integration  time. 
In  all  cases,  the  signal  to  noise  ratio  for  76  percent  correct 
detection  performance  decreases  as  the  duration  of  the  signal 
increases . 

Effective  masking  bandwidths  were  estimated  for  six  frequencies 
with  signal  durations  of  750  milliseconds  in  the  previous  study 
for  the  Applied  Technology  Laboratory  (Reference  27).  Sinusoids 
presented  at  750  milliseconds  were  included  in  the  current  study 
along  with  five  other  durations  of  signal  in  order  to  compare 
the  results  of  the  two  studies.  Figure  7  compares  the  average 
signal  to  noise  ratios  when  d'  =  1  with  a  signal  duration  of 
750  milliseconds  for  the  previous  and  current  studies  for  the 
Applied  Technology  Laboratory. 

Figure  7  also  shows  the  95  percent  confidence  interval  about  the 
mean  when  d'  *  1  at  each  frequency.  As  can  be  seen  from  Figure 
7,  the  differences  at  higher  frequencies  (above  125  Hz)  are 
unlikely  to  have  arisen  by  chance  alone.  No  simple  explanation 
can  be  provided  for  the  differences  at  the  higher  frequencies. 
However,  greater  consideration  should  be  given  to  the  mean  values 
found  in  the  current  study,  due  to  the  increased  precision 
afforded  by  the  psychophysical  procedure  used  in  the  current 
study . 
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Signal  to  Noise  Ratio  at  76%  Correct  Detection  Level 


O  1st  Stud/  (Average) 

•  2nd  Study  (Average) 

J  95%  Confidence  Interval 


\  \ 


1 


1 


63  125  250 

Frequency  in  Hz 


1000 


Figure  7.  Comparison  of  Signal  to  Noise  Ratios  at  76%  Level 
at  Different  Frequencies  During  1st  Study  and  2nd 
Study  With  750  Millisecond  Duration. 
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The  effect  of  the  duration  of  the  signal,  which  was  of  most 
interest  in  the  study,  was  examined  by  plotting  the  signal  to 
noise  ratios  as  functions  of  signal  frequency  and  duration.  The 
expectation  was  that  at  shorter  signal  durations,  signals  of 
equal  energy  (defined  as  the  signal  sound  pressure  level  plus  10 
times  the  logarithm  of  the  signal  duration)  are  equally  detectable. 

Figure  8  shows  regression  lines  through  the  signal  to  noise  "ratios" 
corresponding  to  d1  =  1  points  averaged  across  the  five  observers. 
The  metric  of  the  ordinate,  E/N  ,  is  the  signal  energy  in  decibels 
(sound  pressure  level  plus  10  times  the  log  of  the  duration  of 
the  signal  in  seconds)  minus  the  spectrum  level  of  the  noise  in 
decibels  (sound  pressure  level  in  a  1-Hz  wide  band).  Table  5 
contains  the  E/Nq  values  at  the  76  percent  correct  detection 
level  for  all  observers  at  five  frequencies.  These  values  differ 
from  those  in  Table  A  only  by  the  10  log  duration  correction. 

The  E/Np  values  for  each  observer  plotted  as  functions  of  the 
log  of  the  signal  duration  may  be  found  in  Appendix  B. 

The  data  collected  at  signal  durations  of  50,  100  and  200  milli¬ 
seconds  suggest  that  detection  performance  with  signal  durations 
shorter  than  the  auditory  integration  time  depends  on  the  signal 
energy  alone.  A  horizontal  line  is  drawn  through  the  average 
of  the  three  data  points  in  Figure  8  at  the  shortest  durations. 

The  mean  deviation  about  the  horizontal  line  is  only  0.25  dB 
for  all  frequencies. 

For  signal  durations  in  excess  of  the  aural  integration  time, 
prior  research,  along  with  mathematical  models  of  the  detection 
process,  suggested  that  detection  performance  is  based  on  a 
vector  summation  of  the  signal  detectability  (d')  in  sequential 
nonoverlapping  integration  periods.  That  is,  if  the  signal 
duration  is  longer  than  the  aural  integration  time,  the  listener 
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Figure  8.  YJ 


Signal  Duration  in  milliseconds 


Figure  8.  Continued. 
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will  (1)  divide  the  total  signal  duration  Into  N  segments ,  each 
of  which  is  only  one  integration  period  long,  (2)  compute  the 
signal  detectability  separately  for  each  segment,  and  (3)  compute 
the  composite  detectability  for  the  entire  signal  by 

d'c  =  CCdV2  +  (d’2)2  +  ...  (d'N)2]1/2 

Under  such  a  detection  model  the  signal  energy  required  for 
equal  detectability  increases  at  the  rate  of  5  dB  per  multiple  of 
10  times  duration.  A  best  fitting  regression  line  of  this  slope 
is  shown  for  the  data  at  the  500,  750  and  1000  millisecond  dura¬ 
tions  at  each  frequency.  The  intersection  point  of  the  two  line 
segments  provides  the  best  estimate  of  the  aural  integration  tir.-. 
at  each  frequency.  At  63  Hz  the  aural  integration  time  appears 
to  be  less  than  that  at  125  Hz  and  above.  However,  the  estim.v. o 
of  integration  time  at  ^0  Hz  provided  by  the  intersection  of  ’ 
two  line  segments  is  almost  equal  to  the  integration  time  a- 
1000  Hz. 

The  difference  in  E/Nq  values  for  detection  of  signals  of  750 
milliseconds  duration  (the  duration  used  in  Reference  27)  and  f:r 
those  within  the  constant  energy  regime  was  of  central  interest 
in  this  study. 

Table  6  shows  the  AE  values,  or  the  difference  in  E/No  values 
found  with  a  750  millisecond  signal  duration  and  the  average 
E/Nq  value  over  the  50,  100,  and  200  millisecond  durations  (the 
equal  energy  regime).  The  average  AE  value  is  also  shown  for 
each  frequency  in  Figure  8.  Note  that  at  125,  250  and  1000  Hz 
the  mean  differences  are  relatively  small  (2.2  dB).  Deviations 
about  this  mean  were  only  0.5  dB.  Using  Fletcher's  critical 
ratio,  such  small  differences  produce  only  minor  changes  in  mask¬ 
ing  bandwidths  estimated  by  the  critical  ratio  assumption. 
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The  largest  difference  occurred  at  63  Hz,  where  a  3.6  dli  dif¬ 
ference  from  the  E/N  value  found  at  750  millisecond;-  durable 
and  the  constant  energy  average  was  observed.  However,  ni  '* 
Hz  the  difference  was  2.7  dB,  identical  to  the  difference 
observed  at  1000  Hz. 

TABLE  6.  AE  VALUES  AS  A  FUNCTION  OF  FREQUENCY,  IN  dB 

Sub.i  ect 


Frequency 

1 

2 

3 

4 

5 

X 

0 

40 

1.4 

3-4 

1.7 

4.5 

2.3 

2.7 

i  .  ; 

63 

3.5 

3-2 

- 

4.0 

3.8 

?  f 

125 

2.2 

0.8 

2.8 

3.2 

0.8 

2 . 0 

] .  i 

250 

1.3 

1.9 

2.2 

2.5 

1.8 

1 . 9 

C  .  - 

1000 

2.6 

2.5 

- 

3.5 

1.9 

2.7 

;  .  - 

Figure  9  shows  the  average  AE  values  found  at  the  five  frequen¬ 
cies  and  the  95  percent  confidence  interval  around  each  mean 
value.  The  lack  of  systematic  differences  in  AE  values  over  the 
frequency  range  of  interest  suggests  the  absence  of  meaningful 
duration  effects  as  a  function  of  frequency. 


Sinusoids  of  125  Hz  were  presented  at  all  six  durations  in  a  back¬ 
ground  noise  spectrum  level  of  40  dB.  The  same  signals  were  als-' 
detected  at  200  and  750  millisecond  durations  in  a  60  dB  back¬ 
ground  noise  spectrum  level.  The  average  difference  in  detec¬ 
tion  performance  observed  between  the  two  noise  level  cond.it  ions 
was  1.4  dB.  A  difference  of  1.6  dB  was  found  between  these 
background  spectrum  levels  in  Reference  28.  The  excellent 


agreement  of  these  tv.fo  findings  substantiates  the  effect  of 
absolute  level  of  the  background  noise  on  detection  perfor¬ 
mance  . 


40  63 
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DISCUSSION 


The  orderliness  of  the  data  shown  in  Figures  2  through  6  reflects 
the  precision  of  the  psychophysical  procedure  employed  in  this 
study.  For  signal  durations  less  than  the  aural  integration  time, 
data  points  never  deviate  more  than  0.25  decibels  from  the  least 
square  regression  line.  For  the  higher  signal  durations,  the 
maximum  deviation  from  the  regression  lines  is  less  than  0.5 
decibels.  Considering  that  each  data  point  was  determined  on 
the  basis  of  approximately  1500  two-alternative  trials,  this 
regularity  is  not  altogether  surprising. 

Initial  estimates  of  the  aural  integration  time  from  the  liter¬ 
ature  (approximately  200-300  milliseconds)  were,  in  retrospect, 
excellent  for  purposes  of  experimental  planning.  The  three 
signal  durations  chosen  on  either  side  of  this  value  fell  decided¬ 
ly  into  the  temporal  regimes  (shorter  and  longer  than  the  audi¬ 
tory  integration  time)  discussed  on  pages  32  et  seq.  Durations 
at  or  near  the  integration  time  appear  to  have  been  successfully 
avoided  as  planned. 

The  dashed  lines  shown  in  Figure  8  are  the  best  fits  of  the  data 
to  the  detection  model  for  the  temporal  regime  greater  than  the 
auditory  integration  time.  Note  that  the  ordinate  in  Figure  8 
is  signal  energy ,  not  signal  level.  Under  the  model  the  signal 
energy  required  for  equal  detection  performance  should  remain 
constant  up  to  durations  equal  to  the  integration  time.  For 
greater  durations,  the  required  signal  energy  should  increase  at 
a  rate  of  1.5  dB  per  doubling  of  duration.  The  dashed  lines  of 
these  slopes  appear  to  fit  the  data  quite  well,  with  maximum 
deviations  of  0.5  dB.  The  intersection  of  these  two  lines 
provides  the  best  estimate  of  the  aural  integration  time.  Note 
that  at  all  frequencies  except  63  Hz,  the  estimates  fall  in  the 
range  of  220  to  310  milliseconds,  but  with  no  apparent  sys¬ 
tematic  dependency  on  frequency.  The  exception,  63  Hz,  actually 
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estimates  an  integration  time  (1-41  milliseconds)  which  is  in 
the  constant  energy  regime  and  is  in  all  probability  unrealisti¬ 
cally  low.  There  is,  unfortunately,  no  immediate  explanation 
for  this  anomaly. 

Upon  more  careful  inspection  of  the  500,  750  and  1000  milli¬ 
seconds  data  points  in  Figure  8,  it  appears  that  a  slightly 
steeper  slope  (perhaps  2.0  to  2.3  dB  per  doubling)  might  provide 
a  better  fit  to  the  data.  This  slope  would  alleviate  the  anomaly 
found  at  63  Hz,  but  would  require  a  reassessment  of  the  assumed 
detection  model  at  higher  durations.  To  put  this  slope  into 
some  perspective,  1.5  dB/doubling  provides  the  very  best  de¬ 
tection  performance  when  the  observer  uses  all  of  the  informa¬ 
tion  over  the  total  signal  duration  with  perfect  recall  of 
assessed  detectability  in  each  temporal  subdivision.  In  con¬ 
trast,  3.0  dB/doubling  would  be  expected  if  the  observer  listened 
for  only  one  integration  period,  made  his  judgment,  and  complete¬ 
ly  ignored  the  remainder  of  the  signal  duration.  Thus,  a  slope 
of  2.0/doubllng  indicates  that  the  observers  are  still  able 
to  perform  the  temporal  subdivision  process,  but  may  either 
(1)  forget  or  underrate  the  assessed  detectability  of  the 
signal  at  Its  onset,  or  (2)  become  somewhat  less  vigilant  in 
the  detection  task  toward  the  end  of  the  signal. 

The  comparison  of  signal  to  noise  ratios  in  Figure  7  suggests 
non-negllgible  differences  between  results  obtained  in  Reference 
27  and  the  current  study.  With  the  exception  of  ^0  and  63  Hz, 
data  from  the  two  studies  do  not  fall  within  each  other's  95# 
confidence  intervals.  Although  the  different  trial  procedures 
of  the  two  studies  might  account  for  some  of  this  difference, 
it  seems  unlikely  that  the  differences  would  be  frequency 
dependent.  The  increased  precision  of  measurement  of  the  psycho¬ 
physical  procedure  used  in  the  current  study  Is  thought  to  pro¬ 
vide  more  reliable  and  stable  results  than  those  of  Reference  27 . 


Table  7  compares  estimated  masking  bandwidths  for  the  current- 
study  and  Reference  27.  These  estimates  are  based  on  apnlicat 1 
of  Fletcher's  critical  ratio  assumptions  to  the  data  of  Table  7 
(in  the  case  of  Reference  27)  and  Figure  7  (in  the  case  of  the 
current  data).  The  two  sets  of  estimates  are  in  considerable 
disagreement  at  frequencies  below  125  Hz. 


TABLE  7.  BANDWIDTHS  OF  HYPOTHETICAL  AUDITORY  FILTER  INFERRED  b\ 
FLETCHER'S  CRITICAL  RATIO  METHOD 


Center  Frequency  (Hz) 

Estimated  Masking 
(Reference  27)  (Hz) 

Bandwidth  Prestn* 
Study  (Hz) 

1000 

153* 

]  *  /  ‘V  * 

250 

66 

7  2 

125 

70 

82 

63 

66 

170 

40 

90 

f 

L. .  '• 

•Forced  to  agree  with  value  assumed  in  Reference  30. 

It  seems  clear  that  the  low  frequency  masking  bandwidth  estimate 
of  the  present  study  are  simply  not  credible.  For  the  hypothe¬ 
tical  auditory  filter  centered  at  40  Hz  to  be  260  Hz  wide,  it 
would  have  to  be  far  more  asymmetric  than  it  is  known  to  be  on 
the  basis  of  other  research  (Reference  29).  For  that  matter, 
the  estimate  of  the  masking  bandwidth  at  40  Hz  from  Reference 
27  (90  Hz)  is  also  implausibly  wide. 

The  obvious  problem  is  the  assumed  reference  bandwidth  of  153  H: 
at  1000  Hz.  This  value  was  chosen  to  agree  with  the  value  incor 
porated  in  the  acoustic  detection  range  software  described  In 


Reference  30.  Abrahamson  in  turn  adopted  this  value  from  the 
research  of  Greenwood  (Reference  33).  Patterson  (Reference  34) 
has  since  shown  that  Greenwood’s  estimate  is  too  large  by  a 
factor  of  at  least  two,  due  to  inappropriate  assumptions  about 
the  detection  strategies  used  by  the  observers  in  Greenwood's 
experimentation. 

A  more  generally  accepted  value  for  the  effective  masking  band¬ 
width  at  1000  Hz  is  about  65  Hz.  Applying  Fletcher's  critical 
ratio  assumptions  to  the  current  data  with  this  reference  value 
produces  a  masking  bandwidth  estimate  of  35  Hz  at  125  Hz,  a  figure 
in  good  agreement  with  the  estimate  of  28  Hz  from  Reference  29- 
Since  no  testing  was  conducted  at  frequencies  below  125  Hz  in 
Reference  29,  it  is  not  possible  to  determine  how  meaningful  the 
wider  bandwidth  estimates  obtained  in  the  present  study  at  63 
and  40  Hz  may  be. 

These  anomalies  in  the  frequency  domain,  while  meriting  further 
study  in  their  own  regard,  do  not  affect  the  overall  conclusion 
that  can  be  drawn  from  the  current  study  in  the  temporal  domain. 
There  is  no  reason  to  modify  the  prediction  model  of  Reference  30 
on  the  basis  of  the  empirical  evidence  of  systematic  differences 
in  temporal  integration  intervals  at  low  frequencies. 


33Greenwood,  D.  D.,  AUDITORY  MASKING  AND  THE  CRITICAL  BAND, 
Journal  of  the  Acoustical  Society  of  America,  33,  484-502, 

^Patterson,  R.  D.,  AUDITORY  FILTER  SHAPES  DERIVED  WITH  NOISE 
STIMULI,  Journal  of  the  Acoustical  Society  of  America, 

59,  640-654,  197^ 
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CONCLUSIONS 


1)  The  signal  to  noise  ratio  that  human  observers  require  to 
maintain  a  constant  level  of  detection  performance  (percent 
correct  decisions)  decreases  as  the  duration  of  the  signal  to 
be  detected  increases  from  50  to  1000  milliseconds. 

2)  Detection  performance  of  human  observers  listening  for 
acoustic  signals  embedded  in  noise  can  ba  modeled  as  a  simple 
energy  integration  for  signal  durations  less  than  about  a 
quarter  of  a  second  (x  in  Eq.  1,  p.  8  =  240  milliseconds). 

3)  For  present  purposes,  the  integration  time  of  the  human 
auditory  system  (x)  may  be  regarded  as  invariant  over  the 
frequency  range  of  40  to  1000  Hz.  Thus,  no  modification  to 
ATL's  acoustic  detection  range  prediction  software  is  needed 
to  account  for  differences  in  the  short  term  (less  than  a 
quarter  of  a  second)  detectability  of  acoustic  signals  of 
different  frequencies  between  40  and  1000  Hz. 

4)  Low  frequency  masking  bandwidths  inferred  from  the  data 
of  the  present  study  are  wider  than  those  inferred  from  the 
data  of  Ref.  27.  Both  sets  of  estimates  are  very  sensitive 
to  small  changes  (+  1  dB)  in  signal  to  noise  ratios  needed  to 
maintain  constant  detection  performance  at  low  frequencies. 
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RECOMMENDATIONS 


Because  ATL’s  acoustic  detection  range  prediction  software 
is  insensitive  to  signal  duration,  it  predicts  the  same 
detection  range  for  both  instantaneously  and  continuously 
audible  signals.  The  results  of  the  present  experimentation 
clearly  show  that  signals  of  greater  duration  require  lower 
signal  to  noise  ratios  to  be  detected  with  the  same  pro¬ 
bability  as  signals  of  shorter  durations.  Analyses  of  the 
present  findings  show  two  relationships  between  signal  to 
noise  ratios  needed  for  constant  detection  performance  as 
a  function  of  signal  duration: 

1.  For  signal  durations  less  than  the  integration  time 
(t),  human  detection  may  be  modeled  as  a  simple  energy 
integration  in  which  process  signals  of  equal  energy 
(product  of  amplitude  and  duration)  are  equally  detectable. 
Signal  levels  required  for  constant  detection  performance 
therefore  decrease  by  3  dB  for  every  doubling  of  duration. 

2.  For  signal  durations  greater  than  the  auditory  inte¬ 
gration  time,  human  detection  may  be  modeled  as  a  temporal 
subdivision  process  in  which  detectability  ( d ’ )  is  in¬ 
dependently  assessed  in  sequential,  nonoverlapping  inte¬ 
gration  periods  and  is  orthogonally  vector  summed.  Signal 
levels  required  for  constant  detection  performance  therefore 
decrease  at  a  rate  of  1.5  dB  for  every  doubling  of  duration. 

If  incorporated  in  ATL’s  software,  these  relationships  would 
improve  the  generality  of  its  prediction  algorithms. 
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APPENDIX  A 
TEST  PROCEDURE 

The  trialwise  adaptive  two-alternative  forced  choice  sirual 
detection  procedure  begins  with  a  conversation  between  the 
experimenter  and  the  computer-based  system.  The  system  prompt ; 
the  experimenter  for  information  needed  to  conduct  the  experi¬ 
ment  by  inquiring  about  the  conditions  for  data  collection.  7h 
program  begins  by  typing  a  one-line  identifier  and  waiting  for 
the  experimenter  to  type  one  of  three  commands.  A  "C"  diroc' r 
the  program  to  enter  calibration  mode.  The  experimenter  is 
asked  for  the  data  and  experimenter  identification.  The  program 
then  asks  for  specific  information  about  the  signal  and  th 
background  noise  and  allows  the  experimenter  to  adjust  the  level 
of  each.  The  experimenter  adjusts  the  level  of  the  signal  to 
20  dB  above  the  level  at  which  it  is  barely  audible.  When  the 
calibration  procedure  is  complete  the  program  returns  to  command 
mode . 

The  second  command,  an  "M",  allows  the  experimenter  to  change 
the  various  operating  parameters  of  the  program.  The  program 
presents  the  current  values  of  the  parameters  for  confirmation 
and/or  modification.  The  experimenter  may  change  the  three 
trial  parameters  which  define  the  length  (in  msec)  of  the  obser¬ 
vation,  intratrial  and  response  intervals.  The  experimenter 
then  specifies  the  maximum  number  of  trials  to  be  presented  in 
a  given  block  of  trials,  and  the  initial  step  size  and  the 
smallest  step  size  in  decibels.  The  experimenter  can  also 
request  paper  tape  output . 

Next,  the  experimenter  specifies  stopping  criteria:  the  desired 
size  of  the  95%  confidence  interval  in  decibels  and  the  maximum 
number  of  reversal  cycles.  The  experiment  terminates  either 


when  it  reaches  a  criterion  confidence  interval  or  by  completing 
a  specified  number  of  reversal  cycles  in  responses. 

The  bonus  parameters  may  also  be  altered  by  the  experimenter. 

The  experimenter  defines  the  bonus  in  cents  to  be  paid  for  an 
average  performance  by  the  test  subject  as  well  as  the  addi¬ 
tional  payoff  for  performance  that  falls  above  or  below  average. 

A  differential  payoff  rate  is  calculated  in  cents  per  decibel 
deviation  from  average  performance.  The  experimenter  also  limits 
the  maximum  bonus  paid  to  the  test  subject.  The  program  does 
not  permit  a  negative  bonus  to  be  computed. 

Following  definition  of  the  operating  parameters,  the  experi¬ 
menter  proceeds  to  actual  running  of  the  program  by  typing  an 
"R"  for  Run.  The  experimenter  enters  the  subject  identification 
and  specifies  a  starting  signal-to-noise  ratio  in  dB.  The 
program  can  continue  to  operate  automatically  for  a  number  of 
runs  as  determined  by  the  experimenter.  The  experimenter's 
last  input  to  the  program  is  signal  to  noise  ratio  at  the 
detection  level  of  Interest.  A  heading  is  printed  on  the  tele¬ 
type  for  Information  supplied  to  the  experimenter  following  each 
block  of  trials. 

Finally,  the  experimenter  types  a  ’space'  to  start  the  experiment. 
This  causes  the  "Start"  button  on  the  subject's  response  box  to 
light  up.  After  depression  of  the  "Start"  button,  a  grid  on  a 
storage  oscilloscope  is  plotted  and  the  experiment  commences. 

This  display  provides  a  real-time  plot  of  two  experimental 
parameters:  the  level  of  the  signal  and  the  performance  of  the 

subject.  Figure  A-l  shows  a  sample  of  the  grid  displaying  the 
level  of  the  signal  on  each  of  the  two-alternative  forced  choice 
trials.  For  each  signal  presentation,  the  display  Indicates 


SIGNAL /NOISE  <DB>  SIGNRL/NOISE  (DB) 


whether  the  response  was  correct  -  "C/R",  incorrect  -  "I/R", 
or  that  there  was  no  response  -  "N/R"  by  placing  a  circle  in 
the  proper  area  of  the  grid.  The  signal  level  for  the  next 
trial  is  displayed  on  the  grid  just  before  it  is  presented  to 
the  test  subject.  The  performance  of  the  subject  is  shown  at 
the  top  of  the  grid. 

The  osc illoscopic  display  shows  the  level  of  the  signal  and  the 
response  of  the  subject  for  the  entire  block  of  trials.  The 
signal  is  initially  presented  at  a  very  high  level  so  that  it 
is  clearly  audible.  Adjustments  to  the  signal  to  noise  ratio 
are  initially  large,  but  are  halved  after  the  first  reversal  of 
direction  of  adjustment  until  a  minimum  step  size  criterion  is 
reached . 

After  two  complete  reversal  cycles  the  program  checks  to  deter¬ 
mine  if  It  lias  reached  a  minimum  step  sir.*.-  as  specified  by  the 
experiment  er .  If  the  current  step  sire  equals  the  minimum  st 
sire,  the  computer  considers  the  first  two  reversal  cycles  as 
training  and  proceeds  to  cumulate  statistics  of  detection  per¬ 
formance.  If  the  step  sire  is  above  the  minimum  step  sire,  tie- 
computer  continues  with  another  cycle  of  training  trials  until 
a  reversal  is  reached.  A  minimum  cf  two  training  cycles  is 
required  at  the  beginning  of  each  Mock. 

The  first  21  trials,  shown  in  Figure  A-l,  were  treated  as 
training  trials.  Following  two  reversal  cycles  the  com.ru*  er 
began  cumulating  response  data.  The  subjec*  completed  7 
reversal  cycles  in  the  remaining  7  trials  of  the  Mo- ok .  Tie 
computer  terminated  the  block  after  M  trials  wh*  r;  It  J*-t  err.!*:* 
that  there  were  too  few  remaining  trial-  i:  tie-  1  1  -o  c  f  1  '  *  - 
complete  another  reversal  ey  •  ]** . 


The  experimenter  may  halt  the  experiment  while  the  program 
is  running  in  four  ways:  typing  a  "T"  causes  the  experiment 
to  stop  after  the  next  trial;  typing  a  "C"  causes  the  experi¬ 
ment  to  halt  after  completion  of  the  next  reversal  cycle; 
typing  a  "B"  causes  the  program  to  halt  after  the  next  block 
of  trials;  and  typing  an  "R"  halts  the  experiment  after  com¬ 
pletion  of  the  next  run.  The  experimenter  may  cancel  any  of 
the  operator  intervention  commands  by  typing  a  'space'  before 
the  experiment  halts.  After  the  experiment  stops,  statistics 
cumulated  up  to  the  point  of  stopping  for  the  block  are  printed 
on  the  teletype.  However,  the  statistics  are  not  cumulated  for 
output  on  the  line  printer  until  the  experimenter  indicates 
whether  to  keep  or  discard  the  current  block.  The  experimenter 
then  elects  either  to  continue  or  end  the  run. 

Statistics  are  printed  on  the  line  printer  for  each  reversal 
cycle.  Figure  A-2  shows  an  example  of  the  output  to  the  line 
printer  for  a  block  of  trials.  The  output  first  indicates  the 
number  of  training  trials  for  each  block.  For  each  reversal 
cycle,  the  beginning  and  ending  time  are  printed  along  with 
the  mean  signal  to  noise  ratio  for  the  cycle,  the  number  of 
trials  before  a  reverse  response  occurred  and  the  number  of 
trials  in  which  the  subject  failed  to  make  a  response. 

Cumulative  statistics  for  the  blocks  are  provided  starting 
with  the  first  reversal  cycle.  The  mean  signal  to  noise 
ratio,  95  percent  confidence  interval,  and  the  percent  correct 
in  interval  one  and  interval  two  are  cumulated  until  the  end 
of  the  block. 
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Figure  A-2.  Sample  Line  Printer  Output 


TEST  INSTRUCTIONS 


This  experiment  will  test  your  ability  to  hear  a  faint  signal. 
The  signal  will  always  occur  in  distinct  trials.  Each  trial  is 
composed  of  two  listening  intervals.  The  buttons  in  front  of 
you  labelled  "1"  and  "2"  light  up  during  the  first  and  second 
listening  intervals  of  each  trial,  respectively.  After  lister.in 
during  the  two  intervals,  your  job  is  to  push  either  3utton  "1" 
or  "2",  corresponding  to  the  listening  interval  in  which  you 
think  the  signal  has  occurred. 

The  signal  you  are  listening  for  will  occur  equally  often  in 
each  listening  interval,  but  in  random  sequence.  Since  there  is 
no  pattern  whatsoever  to  the  order  in  which  the  signal  occurs 
in  the  two  listening  intervals,  you  must  base  your  decision 
solely  upon  what  you  hear. 

A  constant  background  of  noise  will  be  playing  in  the  chamber  at 
all  times.  The  signal  will  be  easy  to  hear  when  the  test  first 
begins.  The  level  of  the  signal  will  be  gradually  adjusted  accc 
ding  to  your  correct  or  incorrect  decisions.  When  you  make  a 
correct  response,  the  level  of  the  signal  is  lowered.  When  you 
make  an  Incorrect  response,  the  level  of  the  signal  is  raised, 
some  trials  the  level  of  the  signal  will  be  so  low  that  you  will 
have  difficulty  deciding  which  listening  interval  contained  the 
signal . 

Regardless  of  how  confident  you  feel  about  your  decision,  you 
must  push  one  of  the  two  buttons  on  each  trial.  You  have  at 
least  a  50  percent  chance  of  getting  a  correct  answer  just  by 
guessing. 
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If  you  do  not  make  a  response,  the  level  of  the  signal  remains 
the  same.  This  prolongs  the  length  of  the  experiment  consi¬ 
derably.  Remember  that  the  signal  always  occurs  during  either 
interval  1  or  2  of  each  trial,  even  if  you  are  not  certain  of 
the  interval  in  which  it  occurs. 

You  must  respond  in  the  one  second  interval  following  the  presen¬ 
tation  of  the  two  listening  intervals.  Do  not  respond  before 
you  see  the  light  corresponding  to  interval  2  go  out.  Directly 
after  your  response  on  each  trial,  the  button  corresponding  to 
the  listening  interval  in  which  the  sound  actually  did  occur  will 
light  up  briefly.  This  will  let  you  know  immediately  whether 
your  decision  was  correct  or  incorrect. 

An  experimental  session  continues  until  your  percentage  of 
correct  decisions  has  reached  a  predetermined  level.  In  order 
to  end  an  experimental  session  as  rapidly  as  possible,  it  is 
therefore  necessary  for  you  to  listen  very  carefully  on  each 
trial,  and  to  make  your  decision  as  best  you  can. 


APPENDIX  B 

GRAPHICAL  SUMMARY  OF 
INDIVIDUAL  SUBJECT  DATA 


This  appendix  contains  plots  of  detection  data  for  each  test 
subject.  Tabular  data  from  these  graphs  may  be  found  in  Table 
5  of  this  report. 
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Figure  B-2.  E/Nq  Vs  Duration  Curves  for  Subject  2 
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Figure  B-2.  Continued. 
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Figure  B-4.  E 


for  Subject  4 
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Figure  B-4.  Continued. 
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Figure  B-5.  Continued. 
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APPENDIX  C 

TERMINOLOGY  FOR  SIGNAL  TO  NOISE  RATIO  MEASUREMENTS 


Throughout  the  text  of  this  report,  signal  to  noise  ratio  quanti¬ 
ties  are  expressed  in  different  forms  (e.g.,  S/N,  E/N  ,  or  in 
words)  as  appropriate  to  the  context  of  the  discussion.  In  all 
cases,  thf  Intent  of  the  terminology  is  to  express  the  division 
of  a  quantitative  measure  of  signal  intensity  by  a  quantitative 
measure  of  noise  Intensity.  The  bandwidth  of  the  signal  measure¬ 
ment  is  implicitly  taken  to  be  1  Hz,  since  the  signals  of  Interest 
are  pure  sinusoids.  The  bandwidth  of  the  noise  measurement  is 
also  1  Hz,  unless  otherwise  stated. 

Consider,  for  example,  the  quotient  of  signal  power  (in  watts) 
and  noise  power  (also  In  watts).  Dividing  signal  power  by  noise 
power  yields  the  dimensionless  ratio  referred  to  as  "signal  to 
noise  ratio".  This  ratio  can  also  be  expressed  in  decibels  by 
multiplying  the  logarithm  (base  10)  of  the  quotient  by  10,  as: 

ratio  in  decibels  =  10  log  (|)  =  10  log  (S)  -  10  log  (N) 

where:  10  log  (S)  =  signal  level  in  decibels 

10  log  (N)  =  noise  level  in  decibels. 

From  the  above  equation  It  is  clear  that  the  ratio  in  decibels  may 
be  obtained  by  simply  subtracting  the  noise  level  (in  decibels) 
from  the  signal  level  (In  decibels).  Since  sound  levels  (signal 
or  noise)  are  both  conveniently  measured  in  decibels,  it  Is  most 
convenient  to  express  their  power  ratio  in  decibels  as  well. 

Unless  otherwise  stated,  all  signal  to  noise  ratios  in  this  re¬ 
port  are  presented  In  decibels,  along  with  a  brief  description 
of  how  the  signal  and  noise  quantities  were  each  measured. 


